Model Video Visualizations
Note: If the videos are not shown properly, please try extracting the zip file and open this webpage from the unzipped folder.
1. Model Visualization on AVSyncD Dataset
The videos are arranged from left to right as follows: KeyVID, KeyVID-Uniform, AVSyncD, and DynamiCrafter.
2. Open-Domain Generation Visualization with Audio Synchronization
Note: Please turn on the volume when playing the videos.
The first audio clip sounds like a hammer striking on a wooden surface, and the second represents four hammer strikes on a metal object.
The results show that our model not only generates videos with the correct pattern of hammer strikes but also hits on different objects based on the material sound.